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An ADPCM Approach to Reduce 
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(Manuscript received March 14, 1979) 

A new ROM-oriented adpcm architecture for processing fi-law en- 
coded speech is presented. The architecture is shown to offer: 
(i) simple hardware realization, (ii) flexibility of algorithm modifi- 
cation, (Hi) fixed hardware complexity with respect to numerous 
nonlinear processings, and (iv) excellent time-sharing capability. 
Performance measurements of a hardware implementation are also 
included. 

I. INTRODUCTION 

Bit rate reduction from the present 64 kb/s per trunk is of interest 
both to reduce per-trunk cost of existing and planned digital transmis- 
sion facilities and to prove-in digital transmission. 

Adaptive differential pcm (adpcm) coding of the type reported in 
Refs. 1 through 4 appears promising for bit rate reduction of speech 
signals. Our specific interest is the bit rate reduction of the conven- 
tional /x-255, 64-kb/s signal. The purpose of this paper is to present an 
adpcm architecture that offers: (i) simple hardware realization, 
(ii) flexible algorithm modification, {Hi) fixed hardware complexity 
with respect to numerous nonlinear processings, and (iv) time-sharing 
capability over many trunks. The performance of a hardware realiza- 
tion of such a coder, which is shared by 24 voice trunks, is also 
reported. 

II. THE PROPOSED ADPCM ARCHITECTURE 

2. 1 Basic structure 

dpcm with adaptive backward quantization and fixed one-tap pre- 
dictor, as proposed and analyzed in Refs. 1, 2, and 3, is our point of 
departure. Figure 1 is the conventional block diagram of an adpcm 
codec and its interface to a /x-law environment. The quantizer with 
scale (e.g., uniform step size) A is denoted by Q A , while the "inverse 
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quantizer" which maps a quantization interval code IQ into a quantized 
signal level is denoted by Qt 1 . The quantization scale A(i) is deter- 
mined by a scale adaptation algorithm from the quantization index 
sequence IQ(i)*, which is sent over the channel. We add a prime to 
the decoder variables. 

Prior research on adpcm assumes a fixed linear signal representation. 
In the case of the /x-law signal environment, this requires ju-to-linear 
(fi/L) and L//n conversions, as shown in Fig. 1. The 14 bits per sample 
required by the linear representation is an indication of the complexity 
of the straightforward approach. 

Our basic approach is to put as much of the adpcm algorithm as is 
presently practical into read-only memory (rom). This allows us to 
confine the 14-bit linear representation to the rom table computation 
(i.e., to the firmware). We motivate our architecture by relating it to 
the more familiar architecture of Fig. 1. For this purpose, assume a 
uniform fi-bit quantizer which can be considered to be a cascade of an 
infinite amplitude quantizer and a B-bit limiter. The argument to 
follow could be generalized to a nonuniform quantizer. (Note that the 
limiting is on the quantizer output code as opposed to input amplitude.) 
Retaining the limiter in its place but shifting the quantizer and inverse 
quantizer past the difference and summing nodesf produces Fig. 2 
from Fig. 1. (For convenience, only a local decoder is shown together 
with the new notation.) To complete the transition to the proposed 
architecture as shown in Fig. 3, we first combine \l/L and Q A into one 
block for /li/apcm conversion via a rom realization. We next note that, 
instead of Xl , we could store any other absolute representation of the 
local decoder estimate; in particular, we use the /t-law representation. 
Combining QX l and L/fi into an apcm/ju block and combining the /i/L, 
predictor multiplication and Q a into a ([i/apcm)a block, we obtain the 
proposed architecture shown in Fig. 3 where /x/apcm, APCM//1, 
([i/apcm)a , and scale adaptation are to be implemented with ROMs. 

The effects of these actions are discussed below. By moving the 
adaptive quantizers through the difference node, we can combine many 
nonlinear functions, which are difficult to realize in combinatorial 
logic, into rom tables. The generation of these tables can be achieved 
with an off-line computer, where the 14-bit precision inherent in the 
jtz-law code is used. In addition, the outputs of these ROMS can be 
restricted to 8-bit apcm words that can be easily manipulated. We next 
examine in more detail our proposed quantization and scale adaptation 
scheme. 



* IQe{±0, ±1, •••, ±2" ' — 1), where B is the number of bits/sample sent over the 
channel. 

t Note that Q(x + v) at Q(x) + Q{y) to within one quantizer interval. 
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LOCAL 
DECODER 
OUTPUT X„ 



Fig. 2 — Relating a conventional to a proposed adpcm structure. 



2.2 Quantization and scale adaptation details 

The ROM architecture of Fig. 3 permits a very general interpretation 
of scale as scale index d and of Qd and Qd 1 as corresponding arbitrary 
quantizers. In this general framework, the adaptation rule of Refs. 1, 
2, and 3 evolves to the following heuristic scale index adaptation and 
corresponding quantization constraint: For large (small) magnitude 
prediction error, the index d is increased (decreased) and the corre- 
sponding quantization Qd and Qd 1 is made coarser (finer). The quan- 
tization we propose is a marriage of uniform and /i-255 quantization; 
we call it semiuniform quantization. The semiuniform quantization 
and corresponding adaptation strategy is best explained by starting 
with uniform quantization where scale is synonymous to step size of 
the uniform quantizer. 

Denoting the uniform quantizer step by A, the adaptation algorithm 
is 

Mi + 1) = Ui)M(i), (1) 

where M (i) A M{ \ IQ(i) | ) A M\iq U) ( are positive and have the ordering 

Af <Mi< — , < Af^-i.j, 

where 



Mo < 1 and M„ B -\ , > 1. 



(2) 
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In Refs. 2 and 5, optimized adaptation vectors 

M=(Mo,M u .-..M^-i.,) 

are proposed and related to quantizer input variance estimation. It will 
be convenient to work with the log transformed version of (1) 

d(i + 1) = d(i) + m(i), (3) 

where A = Aoq d and M = q m for some convenient fixed Ao and base q. 
The scale adaptation can thus be done with the simpler iteration (3), 
while the translation from d to A (rf/A) is done in the rom generating 
software, as indicated in Fig. 3. We next discuss the finite precision 
aspects of a digital realization and the associated ROM size implications. 
Consider first the scale adaptation algorithm. We assume rf to be a 
K-h\t nonnegative integer, < d < 2* — 1, representing 2 K permissible 
scales. Thus, A m i n = A and A max = A <7 <2 _1) . To permit the adpcm 
coder to approximate the idle channel performance of the /t-law coder, 
we let the minimum step size A equal the first chord /i-law step size 
which, for the 14-bit linear representation we use, corresponds to Ao 
= 2. To accommodate the 40- to 50-dB dynamic range of a speech 
signal, it is desired that 



»mnx 



- --«•"-> (4) 

be between 100 and 300. The step-size resolution factor q should not 
be greater than 2, which is the /t-law resolution; nor is much to be 
gained by making q too close to 1. A useful design range is qe[2 l/4 , 2] 
and Ke[3, 5] with a smaller q requiring the larger K. Figure 3 assumes 
K = 5, although a smaller value might be sufficient. Figure 4 is a block 
diagram for the scale adaptation. We show a scale adaptation prepro- 
cessor where the adaptation increment index IA is based on the 
quantizer output past history such as min-max, averaging, or other.* 
For concreteness, we assume IA to be the three most significant 
magnitude bits of IQ. We have introduced F additional fractional bits 
for the internal scale index which we designate by di. The scale 
adaptation iteration is 



did + 1) = 



if di(i) + m(i) < 
did) + m(i) otherwise (5) 

rfmax if dl(i) + m(i) > rfmax, 



* In the hardware realization we have found useful a two-word memory preprocessor 
where the most significant previous \IQ\ bit in addition to the two current most 
significant | IQ | bits determines the current IA. 
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Fig. 4— Scale adaptation block diagram. 

where di and m(i) have it-integer plus F-fractional-bit representation 
and rf m ax < log (/ (16,000/2 /J_1 ) (where the upper bound on rf ma x corre- 
sponds to the step size necessary to accommodate the largest possible 
prediction error (16,000) without overload).* The F fractional bits are 
useful for two reasons. First, they permit adaptation design flexibility 
even with q = 2; for example, with q = 2 and F = 0, the largest step- 
reducing multiplier (closest to I) is 2" 1 = 0.5 while with F = 3 the 
largest step-reducing multiplier is 2~ 0125 « 0.9. Second, they are useful 
in the control of adaptation mistrack due to channel errors. 6,7 Figure 
4 assumes K = 5, F = 3, and up to 8 distinct adaptation multipliers 
implying a 2" X 8 ROM. 

Consider next the quantization. For strictly uniform quantization, 
large amplitude, slowly varying adpcm input signals would generate 
small step sizes and large magnitudes for Xapcm, Xapcm, and Xp^pcm- 
These large magnitudes might require a large number of bits to avoid 
amplitude overload. The amplitude overload problem as a function of 
the number of bits/word (<14) must be assessed. We propose to avoid 
the amplitude overload problem with the following semiuniform quan- 
tization scheme. For each scale index d, there is a threshold value T(d) 
such that for X L < T(d) the quantization is uniform, while for X L > 
T(d) the quantization is a copy of the fi-law quantizer as illustrated in 
Fig. 5. The threshold T(d) is located at approximately a /x-law segment 
boundary such that, for the segments below T(d), the segment step 
sizes are < A = Ao9 rf while, for the segments above T(d), the segment 
step sizes are > A. Thus wasteful quantizer level assignments (and 
therefore bits) are avoided when the final jn-law quantization would 
ignore them anyway. Figure 6 shows the details of uniform and 
semiuniform quantization assuming A = 3. The important feature of 
the semiuniform quantization is that the number of quantizing inter- 



* For example, for q = >/2 and B = 4, d m «, = 22 and therefore K = 4 with dm„ = 15 
or K = 5 with rf ma x€[16, 22] should be satisfactory. 
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Fig. 6— Illustrating uniform and semiuniform quantization. 
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vals is <256 and therefore 8 bits is sufficient for the representation of 
Xapcm, Xapcm, and Xp.apcm- The blocks /x/apcm, apcm//i, and 
(/x/apcm)/a including a sign magnitude/2's-complement or inverse-con- 
version can each be realized with a 2 1:l X 8 ROM, or alternatively one 
can use 2 12 X 7 roms with the sign bit feeding around the rows to 
separate sign magnitude/2's-complement converters. 

Although the optimizations reported in Refs. 2, 3, and 5 do not 
quantitatively apply to our adpcm coder, the qualitative concepts 
developed there with respect to loading, adaptation speed, and others 
are very useful in tuning our coder, and they could be utilized in our 
hardware realization. 

The next section briefly discusses two approaches to dealing with 
transmitter-receiver scale mistrack due to channel errors. The discus- 
sion serves to illustrate the versatility of our ROM architecture. 

III. CHANNEL ERROR AND SCALE MISTRACK 

In a backward adapting quantizer, the possibility of scale mistrack 
due to channel errors must be considered A robust quantizer is 
obtained by modifying (3) [and similarly modifying (5)] to 

d(i + 1) = fid(i) + m(i), (6) 

where /? is some number less than but close to 1. The modification in 
(6) is incorporated into our architecture by a change of the scale 
adaptation ROM. References 6 and 7 discuss in detail performance 
aspects and design guidelines related to the finite arithmetic imple- 
mentation of (6). Also, the improved quantizer reconstruction strategy 
recommended for the robust quantizer in Ref. 8 can be implemented 
by a change in the apcm//i reconstruction table, i.e., the apcm//x rom. 
These improvements require no additional hardware. 

An alternate approach to scale mistrack control is to modify (3) to 

did + 1) = d,(i) + m(i, dj(i)), (7a) 

where 

(m(i)-2- F D 2 <d I (i) 
m(i, did)) = \ m(i) Z>, < d,(i) < D 2 (7b) 

[m(i) + 2 F rf/(i)<D,, 

m = log,, M is an optimized adaptation vector in the absence of channel 
errors, and D\ and D 2 (e.g., D\ = (l/3)rf max and D 2 = (2/3)rf max ) are 
design constants which, for the reason to follow, we call mistrack 
correction levels. Each iteration that di(i) and di(i) are on opposite 
sides of n {n = 1 or 2) of the correction levels, the mistrack | di(i) — 
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Fig. 7 — adpcm coder performance measurements. 

d'i(i) | is reduced by n2~ F . For* 2~ F <^|m,nin| the approach in (7) 
should avoid the error-free dynamic range penalty 6,7 associated with 
(6). Again, the important point is that (7) (or a generalization of it to 
an arbitrary number of correcting levels) can be incorporated into the 
codec by a simple ROM table modification without hardware or com- 
plexity penalty. 

IV. HARDWARE REALIZATION AND PERFORMANCE 

We have constructed an adpcm coder which interfaces directly with 
a 2)3 bank. Because our coder uses parallel processing and the roms 
are relatively fast, we can share the coder among all 24 voice trunks of 
the channel bank. In fact, over 120 trunks could be accommodated if 
we so desire. The coder is implemented with three wire-wrapped 
circuit packs: one each for the interface, the transmitter, and the 
receiver. 

For the signal-to-noise measurements, we chose a prediction coeffi- 
cient A = 0.8 and a scale adaptation vector of (—1, 0, 0, 0, 1, 2, 3, 5) 
with a base q = \/2. Because of the flexibility of the coder, we were 
able to vary the number of bits per sample feeding the transmission 
channel as well as the local decoder. The measurement arrangement 
is shown in Fig. 7. The signal-to-noise versus frequency for the coder 
is shown by the graph of Fig. 8. The measurements were made with 
the coder operating in the 5-bit adpcm mode and with 5 bits in the 



* If it is desired to have a small m such as m = ±2~ F or m = 0, then (7b) can be simply 
modified to exclude the perturbation 2~ F for those m. The ROM modification is again 
straightforward. Also, interleaving as in Ref. 6 should reduce the required F. 
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feedback circuit. The analog input was -15 dBmO. As expected, the 
coder performance drops with increased frequency. Above 1500 Hz 
when the adjacent sample correlation is below 0.4, the differential 
mode of coding has no advantage and in fact has penalty. The remain- 
der of the performance tests were made by measuring the signal-to- 
noise ratio versus amplitude for 1005 Hz. These measurements were 
made for the following coder options: 

U) adpcm with the number of transmit bits equal to 5 and the 
number of feedback bits equal to 5, 4, and 3. 

(ii) adpcm with the number of transmit and feedback bits both 
equal to 4 and 3. 

Figure 9 demonstrates the performance insensitivity as the number 
of feedback bits is dropped. Figure 10 shows the performance for 3 and 
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Fig. 8— adpcm coder performance, frequency response. 
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Fig. 9 — adpcm coder performance, 5-bit. 
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Fig. 10 — adpcm coder performance, 4- and 3-bit. 
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4 bits. Experimental and simulation results for 1004 Hz coding with 4- 
bit adpcm are compared in Fig. 11. 
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